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Abstract 



In this paper we consider a continuous-time stochastic volatility 

a model. The model contains a stationary volatility process, the density 

of which, at a fixed instant in time, we aim to estimate. We assume 
* * i 
^- , that we observe the process at discrete instants in time. The sampling 

times will be equidistant with vanishing distance. 

A Fourier-type deconvolution kernel density estimator based on the 
logarithm of the squared processes is proposed to estimate the volatil- 
ity density. An expansion of the bias and a bound on the variance are 
derived. 

Key words: stochastic volatility models, density estimation, kernel es- 
timator, deconvolution, mixing 
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1 Introduction 

Let S denote the log price process of some stock in a financial market. It 
is often assumed that S can be modelled as the solution of a stochastic 
differential equation or, more general, as an Ito diffusion process. So we 



assume that we can write 

dS t = b t dt + a t dW t , S = 0, (1) 



or, in integral form, 



S t = [ b s ds+ [ a s dW s , (2) 

Jo Jo 



where W is a standard Brownian motion and the processes b and a are 
assumed to satisfy certain regularity conditions (see Karatzas and Shreve 
(1991)) to have the integrals in (|2|) well-defined. In the financial context, 
the process a is called the volatility process. 

In this paper we model a as a strictly stationary positive process sat- 
isfying a mixing condition, for example an ergodic diffusion on [0, oo), and 
we make the assumption that a is independent of W. We will assume that 
the one-dimensional marginal distribution of a has a density with respect to 
the Lebesgue measure on (0, oo). This is typically the case in virtually all 
stochastic volatility models that are proposed in the literature, where the 
evolution of a is modelled by a stochastic differential equation, mostly in 
terms of a 2 , or logo -2 (cf. e.g. Wiggins (1987), Heston (1993)). 

For stochastic differential equations of the type 

dX t = b(X t )dt + a(X t )dB t , 

with Bt equal to Brownian motion, the invariant density is up to a multi- 
plicative constant equal to 

... i .^(,r«2L*\ ( 3) 



a 2 (x) v Jxo ° 2 (y) 

where xo is an arbitrary element of the state space (I, r), see e.g. Gihman 
and Skorohod (1972) or Skorokhod (1989). From formula (0) one sees that 
the invariant distribution of the volatility process (take X for instance equal 
to a 2 or logo -2 ) may take on many different forms, as is the case for the 
various models that have been proposed in the literature. This observation 
supports our point of view that nonparametric procedures are by all means 
sensible tools to get some insight in the behaviour of the volatility. 

In the present paper we propose a nonparametric estimator for the 
volatility density. Using ideas from deconvolution theory, we will propose 
a procedure for the estimation of the marginal density at a fixed point. 
We will assume that we observe the log-asset price S at time instants 
0, A, 2A, . . . , reA, where the time gap satisfies A = A ra — ► and nA„ — > oo 
as n — ► oo. To asses the quality of our procedure, we will study how the 
bias and variance of the estimator behave under these assumptions. 

The remainder of the paper is organized as follows. In the next section, 
we give the heuristic arguments that motivate the definition of our estimator. 



In Section the main result concerning the asymptotic behaviour of the 
estimator is presented and discussed. The proof of the main theorem is 
given in the last two sections. 

2 Construction of the estimator 

To motivate the construction of the estimator, we first consider (|l|) without 
the drift term, so we assume to have 

dS t = a t dW t , S Q = 0. 

It is assumed that we observe the process S at the discrete time instants 0, 
A, 2A, . . . , nA. For i = 1, 2, . . . we work, as in Genon-Catalot et al. (1998, 
1999), with the normalized increments 

x i = 7^ (SiA - %-i)a)- 
For small A, we have the rough approximation 



x A = 4^ 



^= / a t dW t « cr {i _ 1)A -^(W iA - W {i _ l)A ) = a {i _ 1)A zt, 



'(i-l)A 
where for i = 1, 2, . . . we define 



zf = -L(w lA -w { ^ 1)A ) 



By the independence and stationarity of Brownian increments, the sequence 
Z±, Z2, ••• is an i.i.d. sequence of standard normal random variables. More- 
over, the sequence is independent of the process a by assumption. 
Taking the logarithm of the square of Xf' we get 

log(pQ A ) 2 ) « log(4_ 1)A ) + log((Z A ) 2 ), (4) 

where the terms in the sum are independent. Assuming that the approx- 
imation is sufficiently accurate we can use this approximate convolution 
structure to estimate the unknown density / of log(<r J 2 A ) from the observed 
log((A A ) 2 ). 

Before we can define the estimator, we need some more notation. Ob- 
serve that the density of the 'noise' log(Z A ) 2 , denoted by k, is given by 

k{x) = -^=e\ x e~\ e \ (5) 

V2^ 



The characteristic function of the density k is denoted by fa ■ 

We will use a function w, satisfying the following condition. For exam- 
ples of such kernels see Wand (1998). 

Condition W. Let wbea real symmetric function with real valued sym- 
metric characteristic function 4> w with support [-1,1]. Assume further 



1- /f^j \w(u)\du < oo , ff^ w(u)du = 1 , J^ u 2 \w(u)\du < oo , 

2. <£ w (l - i) = At a + o(t a ), as t i for some a > 0. 

Following a well-known approach in statistical deconvolution theory, we 
use a deconvolution kernel density estimator, see e.g. Section 6.2.4 of Wand 
and Jones (1995). Having the characteristic functions (j)k and <j> w a t our 
disposal, choosing a positive bandwidth h, we introduce the kernel function 

and the density estimator 

«*) = ^ £ <* ^ s J • ( 7 ) 

One easily verifies that the function Vh, and therefore also the estimator f n h, 
is real- valued. 

3 Results 

To derive the asymptotic behaviour of the estimator, we need a mixing con- 
dition on the process a. For the sake of clarity, we recall the basic definitions. 
For a certain process X let J-\ be the <r-algebra of events generated by the 
random variables X%, a < t < b. The mixing coefficient a(t) is defined by 

a(t)= sup \P(AnB)-P(A)P(B)\. (8) 

The process X is called strongly mixing if a(t) — ► as t — > oo. 

As we mentioned in the introduction, it is common practice to model 
the volatility process V = a 2 as the stationary, ergodic solution of an SDE 
of the form 

dV t = b(V t )dt + a(V t )dB t . 

The mixing condition that we use in Theorem EO below is satisfied in this 
setup. See for instance Corollary 2.1 of Genon-Catalot et al. (2000). 

It is easily verified that for such processes it holds that E | Vt — Vq \ = 
Oit 1 ' 2 ), provided that b G Li(/j,) and a G ^(/j), where \x is the invari- 
ant probability measure. Indeed we have E|Vj — Vb| < E f Q \b(V s )\ds + 
(E J Q a 2 (V s )ds) 1 ' 2 = t\\b\\ Ll ^ + Vi\\a\\ L2 ^. Although we will not assume 
explicitly that a 2 solves an SDE, the above observation motivates the fol- 
lowing condition. 

Condition a. We have E \a 2 - cr§ | = 0{t 1 / 2 ) for t -> 0. 



The following theorem describes the asymptotic behaviour of our esti- 
mator f n h. Note that it also covers the case where there is a drift bt present 
in equation (|l]). The condition on the drift is boundedness of E6f. This 
condition is typically satisfied in realistic models for the log-returns of a 
stock, since bt is the local rate of return and this will be mostly bounded 
itself. 

Theorem 3.1. Assume that E6f is bounded. Let the process a be strongly 
mixing with coefficient a(t) satisfying, for some < q < 1, 

oo 

a(t) q dt < oo, 
o 

and suppose that Condition a holds. Let the kernel function w satisfy Con- 
dition W and let the density f of log of be continuous, twice continuously 
differ entiable with a bounded second derivative. Also assume that the density 
of of is bounded in a neighbourhood of zero. Suppose that A = n for given 
< 5 < 1 and choose h = 771"/ log n, where 7 > 4/o\ Then the bias of the 
estimator (W satisfies 



E>fnh(x) - f{x) = \h 2 f"(x) j u 2 w{u)du 



o(h 2 ). (9) 



Moreover, the variance of the estimator satisfies 



o(-h^ e 'l h \+o(—^ 



^M*)-o(-t '^"-) +0 {m^a)- (10) 



The proof of the theorem is deferred to the next section. We conclude 
the present section by a number of comments on the result. 

Remark 3.2. The expectation of the deconvolution estimator is equal to 
the expectation of an ordinary kernel density estimator, as becomes clear 



from the proof of Lemma 4.1 



It is well-known that the variance of kernel-type deconvolution estima- 
tors heavily depends on the rate of decay to zero of \4>k(t)\ as |i| — > 00. The 
faster the decay the larger the asymptotic variance. In other words, the 
smoother k the harder the estimation problem. This follows for instance for 
i.i.d. observations from results in Fan (1991) and for stationary observations 
from the work of Masry (1993). 

The rate of decay of |</>fc(£)| for the density (g) is given by Lemma 
5.1 in Section IBl stating !</>&(£) I ~ v2e~2 7r l 4 l j as |£| — ► 00. This shows 



that k is supersmooth, cf. Fan (1991). By the similarity of the tail of this 
characteristic function to the tail of a Cauchy characteristic function we can 
expect the same order of the mean squared error as in Cauchy deconvolution 



problems, where it decreases logarithmically in n, cf. Fan (1991) for results 
on i.i.d. observations. Note that this rate, however slow, is faster than the 
one for normal deconvolution. Fan (1991) also shows that we cannot expect 
anything better. 

Remark 3.3. The choices A = n , with < 6 < 1 and h = jir/logn, with 
7 > 4/<5 render a variance that is of order n _1+1 ' 7 (l/logn) 2a for the first 
term of (|i~0| ) and n~ 1+ (logn) 1+q for the second term. Since by assumption 
7 > 4/<5 we have I/7 < 5/4 < 5 so the second term dominates the first term. 
The order of the variance is thus n~ 1+ (logn) 1+9 . Of course, the order of 
the bias is logarithmic, hence the bias dominates the variance and the mean 
squared error of f n h(x) is also logarithmic. 

Remark 3.4. Better bounds on the asymptotic variance can be obtained 
under stronger mixing conditions. Consider for instance uniform mixing. In 
this case the mixing coefficient (ft(t) is defined for t > as 

<j>{t) = sup \P(A\B) - P(A)\ (11) 

and a process is called uniform mixing if <p(t) — > for t — > 00. Obviously, 
uniform mixing implies strong mixing. As a matter of fact, one has the 
relation 

a(t) < \4>{t)- 

See Doukhan (1994) for this inequality and many other mixing properties. 
If a is uniform mixing with coefficient <f> satisfying J 00 <f>(t) ' 2 dt < 00, then 
the variance bound is given by 

Vta/ rt (,)-0(I»»V/») + 0(-L). (12) 

The proof of this bound runs similarly to the strong-mixing bound. The 
essential difference is that in equation (H3) we use Theorem 17.2.3 of Ibrag- 
imov and Linnik (1971) with r = instead of Deo's (1973) lemma, as in the 
proof of Theorem 2 in Masry (1983). 

Remark 3.5. Smoothness conditions on the density at each time of the so- 
lution of a stochastic differential equation are guaranteed under Hormander's 
condition, see Theorem 2.3.3 in Nualart (1995). Recall also relation (y), 
which can be used to relate the smoothness of the invariant density to the 
smoothness of the drift and diffusion coefficients. 



4 Proof of Theorem [371] 

We give the proof under the additional assumption that bt = 0. The general 
case is an easy consequence. Let T a denote the sigma field generated by 
the process a and let f n h denote the estimator based on the approximating 
random variables a/j_ 1 \^Z^ L , written as Xj, i.e. 

/«*(«) =^ em — i — )• (13) 

The proof of @ follows from the following two lemmas, whose proofs are 
given in the next section. The first one deals with the expectation of f n h- 



Lemma 4.1. We have 

Ef nh (x) = ~j w(^-)f(u)du. (14) 

The second lemma estimates the expected difference between / n /j and 
f n h- The bound is in terms of the functions 

and 

/, s Iff/A , 1 CkI + k/\x\\ \ + -k/\x\ 
7i(/t,x) =e2^ /ft + -exp(^- J log . (16) 

Lemma 4.2. For h — ► and e small enough we have 



( 1 A x /4 i a x /2 e 

°l T2 7o(/i)— _ + r7o (fc)-_+ 71 (^|log2e|//i) 1 



e /i e 2 | log 2e| / 

Notice that the equality (|14| ) is the same as for ordinary kernel estima- 
tors, see for instance Wand and Jones (1995). Statement @ of the theorem 
then follows by combining standard arguments of kernel density estimation 



and Lemma 4.2. We will show that the bound in Lemma 4.2 is essentially 
a negative power of n, whereas h 2 is of logarithmic order. Recall that we 
have assumed 5 > 4/7. It follows that I/27 < (5/4 — I/27, so we can pick a 
j3 6 (I/27, 5/4 — I/27) and take e = n~" . Up to factors that are logarithmic 
in n the order of \E f n h(x) — E/„/j(x)| is then 

n ^-¥+P + n ^+ 2 P- S 2 +n ^ _/3 , (17) 



which is negligible to h? = 7 2 7r 2 /(log n) 2 for the chosen values of the param- 
eters. 

To prove the bound ( JTOJ ) we use the two lemmas below, which are 
proved in the next section. First consider the variance of f n h(x). 

Lemma 4.3. We have, for h — ► 0, 

Vto/ rt (,) = 0(itf"e'/») + 0( ss L s ). (18) 

The next lemma estimates Var (f n h(x) — f n h( x ))- 
Lemma 4.4. We have, for h — > and e > small enough, 
Var(/ nh (x) - fnh{x)) 

= o(^ la {h) 2 ^ + -ii(hMog2e\/hf-^) (19) 

Knn^ e z n |log2q z / 

1 /A^-?)/ 2 

^A°V h 2 e 2 

The proof of ( JT~0| ) is finished as soon as we show that the estimate in 



" «o(i^^ H )' P°) 



Lemma [4.4| is of lower order than the one in Lemma 4.3. Up to terms that 
are logarithmic in n, the bound in Lemma 4.3 is of order n . Choosing 
again e = n~@, up to logarithmic factors, the order of Var (f n h(x) — f n h( x )) 
is 

n -l+i-f +2/3 + n -l+±-f> + n -l + 2/3 + ^ 5 + n -l +S -0(l- q) _ (21) 

Recall our assumption ^7 > 4. If we pick /3 less than I<5(1 — g), then all 
these terms are indeed of lower order than n . □ 



5 Technical lemmas 

5.1 Analytic properties 

We need expansions and order estimates for the functions (/>&, the kernel Vh, 
as defined in (@), 70, as defined in ( |l5[ ) and the function 71, as defined in 



(16). These are collected in the lemmas of this subsection. 
Lemma 5.1. For \t\ — > 00 we have 

\Mt)\ =v / 2e-H*l(l + 0(^) ) . 
Proof. The characteristic function of k is given by 



: {t) = ^=2 it Ta + it). (22) 



vr ' 2 



The result follows by applying the Stirling formula for the complex gamma 
function, cf. Abramowitz and Stegun (1964) Chapter 6. □ 



Lemma 5.2. We have the following order estimate for the I? norm of v^- 
Forh^O 



)• 



|| Vh || 2 = 0(hi +a e"/ 2h 
Proof. By Parseval's identity 

i r 1 1 w (s) 2 



(23) 



\ v h\\l 



27T J_ t l (/) k (s/h) 

The integral on the right-hand side is bounded by 



ds. 



\j \Ms)\ 2 e nWhl ds + J \M*)\' 



1 



\Ms/hW 



l e ir\s/h\ 



ds 



The first term in (E4J) can be rewritten as 



ir/hiA+2a 



X/h 



4> w (l - hv) 



(hv) a 



v 2 " e~™dv ~ e" /h h 1+2a A 2 



v 2a e~ nv 



(24) 



dv, 



by the dominated convergence theorem. We can rewrite the second term in 



(24) as 



2h 1+2a e n/h 



l/h 



[l-hv)\ 



(hv) a 



2 e —rr(l/h-v) 



<> k (l/h-v)\< 



v 2a e~™dv, 



which is of order 0{h 1+2a e K ' ) by the dominated convergence theo- 
rem. We have used the fact that both the functions <f> w (l — u)/u a and 
|(2exp(— 7ru)/\4>k(u)\ 2 ) — 1| are bounded and that the second function is of 
order 0(l/u) as u tends to infinity. This shows that the second term ( |24] ) 
is negligible with respect to the frist one. □ 



Lemma 5.3. For Ii->Ok have 

10 (h) = o(h l+a e^ h ). 
Proof. The proof is similar to that of Lemma |5.2|. 



(25) 
□ 



Lemma 5.4. The functions v^ are bounded and Lipschitz. More precisely, 
for all x we have \vh(x)\ < jo(h) and for all x and u 



\v h (x + u) - v h (x)\ < 7o(/i) \u\ 



(26) 



Proof. The bound for |«/i(x)| is obvious. To prove (2£) write 

■ i 



\v h {x + u) -v h (x)\ < — 



<t>k{s/h) 



1| ds < 7o(/i)|u|. 



D 



Lemma 5.5. For x — ► oo we have the following estimate on the behavior of 
Vh- For some positive constant D it holds that 



and 



\"Vh{x)\ < i — Tli{h,x) as \x\ — > oo, 
\x\ 



ll{Kx) = o(l^i ei'Ci+ir/N)/*) as ft _> o. 



(27) 



(28) 



Proof. By a bound in the proof of the Riemann Lebesgue lemma on page 
402 of Hewitt and Stromberg (1965) we have, with y = tt/x, 



\Vh{x) 



< 



< 



1 

2^ 

i r°° 

^ J-oo 

1 /"OO 

2^ 



oo 



oo 4> k (s/h) 



(s + y) 



4>k{s/h) (f> k ((s + y)/h) 
(f> w (s) -<j) w (s + y) 



ds 



4>k(s/h) 



ds 



1 f°° 
+ 2tt / ^ s + y ' 



1 



1 



4> k (s/h) (f) k ((s + y)/h) 



ds. 



(29) 
(30) 



First we need a bound on the integral (P9[). Since it follows from Conditions 
W that 4> w is Lipschitz (the proof is similar to that of ([2g)), with Lipschitz 
constant C\ say, we have 



j — c 



,(s) -<p w (s + y) 



< 2Ci- 



(j) k {s/h) 
1 



ds<Ci 



1 



l0fc(«A)l 



ds|y| 



= (WI "" V2 



^-e^M, 



by Lemma 5.1. To bound the integral (pQ) we need an estimate on the 



behaviour of |</4l/|</>fc| 2 - Recall the expression (22) for cp k - Hence, with 
^ = r'/r the digamma function, 



\Mt)\ 



;\og2e ino z 2 T{\ + it)+ie in °z 2 T'{\ + it) 



< —(\ og 2\T{\ + it)\ + \T'{\ + it)\ 



and, as |i| — > 00, 



<^) 2 



< v^ t * (log 2 + M\ + it)\) < 40Flog(|t|)eH*l, (31) 

r(~ + zi v 2 / 



10 



by Lemma O] and by the expansion |^(z)| ~ logz for z — ► oo, |Argz| < 7r, 
cf. Abramowitz and Stegun (1964), Chapter 6. We now turn back to the 
integral (pOh and write 



<t>w{s + y) 

»1 



1 



4>k(s/h) <p k ((s + y)/h) 
1 1 



ds 



i 



< - 

- h 

2 

< - 
~ h 



<t>k{(s-y)/h) 4>k{s/h) 



ds 



sup 



(-l-|»l)/h<«<(l+l»l)A 
sup 



\y\ 



h 



in view of (|3l|). This completes the proof. 



4^Flog(|s|)e^l s l|y| 
(-l-[vl)/h<»<(l+[vl)/>> 

7rlog((l + |i/|)/^)e5 ,r ( 1 +li'l)/' , |i/| 



D 



5.2 Proof of lemmas |4.1|44.4 



We start with the proof of Lemma 4.1. Recall that J- a is the er-algebra 
generated by the process a. 



Proof of Lemma 4.1. Write 



V<J nh {x)\T ) = -rj^vfvh 



nh 



x-log<7 2 _ 1)A -log(Z t A ) 2 



t=\ 



h 



\Tn 



1 - 1 

— Y — 

nh ^^ 2-7T 
t=i 



1 n 1 

— V — 

nh *-** 2ir 

t=\ 



oo 


4>w(s) 


-oo 


4>k{s/h) 


oo 


<t>w(s) 



E 



3 <Pk(s/h) 
"tEW ^(^) e-^- logCT (-)A)/^ s 



/ e -is(x-loga2 t _ 1)A -log(Zf) 2 )/h|^\ d g 



nh 



t=i 






x-logaf t _ 1)A 



By taking expectation the result follows. □ 

For the proof of Lemma [4.2| we need a few properties of the process a, 
valid under Condition a. Since (x — y) 2 < \x 2 — y 2 \ for x,y > 0, it holds 
that E (at — ao) 2 = Oft 1 ' 2 ) for t — > 0. Consequently, there exists a constant 
C > such that 



E (Xf - (ToZf) 2 < CA 1 / 2 for A -» 0, 



(32) 



11 



since E (X^— <tq Z^) 2 = -^E f Q (a t — ao) 2 dt. Moreover, Condition a implies 
that 



E 



4- / a? dt - a 2 , 



A / "t 

o 



0{k l/2 ) for A -> 0. 



(33) 



Proof of Lemma 4.2| . Writing 
Wj = v h 



x-io g ((xf) 2 ; 



h 



Vh 



x - log(X? 



so that f n h(x) - fnh{x) = -^ Y!j=l w j> we nave 



\Ef nh (x)-Ef nh (x)\ <-^E\Wj\ 
= h E ^ Wj ^i\xt\>e and \Xi\>e] 



+l E \ W j\hx* 



f|< £ or [Xi[<e] i '[[jrA_Xi[>6] 



+ E|W r i |/ [ | X A|< E or IXil^el^lXf-Xi^e]- 



By Lemma 5.4 and (p2[) the term (p4|) can be bounded by 



^ 7o(/i)E | log((X A ) - log(X 1 )|/ [|X A | > £ and ]jti] > e 



(34) 
(35) 
(36) 



In the same way the term fl35|) can be bounded by 

^(h)P(\Xt-Xi\ >e)< l l0 (h)C^. 
Since the absolute value of both arguments of Vh below are eventually larger 



than | log2e|//i, by Lemma 5.5 the term ( p6[ ) can be bounded by 



i 7l (fc,|l g2 g |/M (|log ^ |A) 



P(\X!\ <2e)<C 2 -Yi(h,\log2e\/h)-. 



log 2e| 



for some constant Ci. Here we used the fact that the density of X\ is 



bounded which follows from the assumption that ok has a bounded density 



in a neighbourhood of zero. □ 

Proof of Lemma |4.3| . Consider the decomposition 

Var (f nh (x)) = Var (E (f nh (x)\T a )) + E (Var (f nh (x)\^)). (37) 



12 



By the proof of Lemma 4.1 the conditional expectation K (f n f l (x)\J-' l j) is 



equal to a kernel estimator of the density of log of . By Theorem 3 of Masry 
(1983), we can bound its variance by 



20(1 + o(l)) 
h l +lA 



n 



/(*) 



1-9 



\ 1— (I f °° / 1 



Given the process a the random variables log X 2 are independent, so we can 
bound the second term in (^) by 



n 



1 n 
_^Var(^ 



2 h 2 



x — log X, 



t=\ 



h 



1 

nh 2 



S-l0gXf\\2 1 



fu <-^EK r i :^ v M )"<-^7ow 2 , 



h 



n 



h? 



by Lemma 5.4. The result follows by an application of Lemma 5.3 



□ 



Proof of Lemma [4.4| . Note that for different i,j, conditional on the process 
a, the pairs X^,Xi and X^,Xj are independent. Hence the conditional 
covariances of functions of these pairs vanish. 



With Wj as in the proof of Lemma 4.2 we have 



Var (f n h(x) - fnh(x)) 

_L Var W 1 + J-^ J^Cov (E (W^), E (W^)). (38) 



nh 2 



i¥=j 



Let us first derive a bound on Var Wi. We have Var Wi < E W 2 , which can 
be split up in three terms 



- E Wj I[\x*\> 6 and |Xi|>e] 



±EW?I UYA 



h 

+~Eirf/ ,a 



Ir, 



J J [|Xf |<e or |Xi|<e]- [ [|Xf-X 1 |> e 



/l 



Ir, 



j J [|Xf |<e or |Xi|<e]-'[|X 1 A -Xi|< £ 



(39) 
(40) 
(41) 



By (|32|) and Lemma ^4| the term (|39j) can be bounded by p- 7o(^) 2 C — 3—. 



Again by ( |32[ ) and Lemma 5.4 the term (pC|) can be bounded by 

4 7o (/ l ) 2 P(|X 1 A - Xi| > e) < 4 7o (/i) 2 C ^-. 
Since the absolute value of both arguments of i> ^ below are eventually larger 



than I log2e|//i, by Lemma 5.5 the term (HT1) can be bounded by 



7 i(/i,|log2 £ |//i)- 
(\log2e\/h) 2 



P(\Xi\ < 2e) < C 2 / i 2 7i(/i,|log2e|//i) i 



log2e| 2 ' 
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for some constant C2, where we use again, as in the proof of Lemma 4.2, 
that the density of X\ is bounded. 
We get 

/ 1 A 1 / 2 f \ 

EW ? = °{^^(h) 2 C — + h 2 11 (h,\log2e\/h) 2 y [ -^), (42) 



which gives the first order bound (|19|). 

Next we concentrate on the sum of covariances in rt387) . Define 



1 f iA 
Oi = - a 2 t dt. (43) 

A J(i-1)A 

Note that given T a , X A is A/"(0, cfj) distributed and Xj is N(0, c^_ 1 )a)- As 
in the proof of Lemma |4.1| it follows that 

t? ra; 1 -r ^ /^-logoA ^ x-log^._ 1)A 
B(^)= W (— — J-«;( jr 

We follow the line of arguments in the proof of Theorem 3 in Masry 
(1983). The stationarity of Wj implies that also the conditional expectations 
Wj := E {Wj\Tff) are stationary. Hence we have 

n~\ 

Y, Cov (W h Wj) = 2 £(n - k)Cov (Wo, W k ). 

i^j fc=l 

Now note that the process Wj is strongly mixing with a mixing coefficient 
a{k) < a((k — 1) A), k = 1, 2, . . . , where a is the coefficient of the process a. 
By a lemma of Deo (1973) for strongly mixing processes it follows that for 
all r > 

\Cov(W ,W k )\ < 10a((k- l)A) T /( 2+T )(E|Wi| 2+T ) 2/(2+T) . (44) 

By the monotonicity of the mixing coefficient a we get 



n 2 h 2 



n-l 



fe=l 

* ^H T/(2+T) + i / 00 «(*r /(5HT) *)(Ei^r) a /(2+T) - 
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Next we derive a bound on E |Tyi| 2+T . Fix k £ (0, 1]. We have 



E|Wi| 2+r = E 



w 



x-log(cri) 



w 



X - log (erg) 



E 



E 



?<> 



tr 



h J \ h 

x *[a 1 >e and o-2> £ ] 

X — log(cii)^ (X — log(<7g)\ 2+T 



2 + T 



w 



h J \ h 

xI [ai<e or «rg<e] I [[ffjf-«jg«[>6] 

x-log(ai) 



If! 



X — log((To)N 2+r 



h J \ h 

[at<e or <rg<e] jr [[fff-ffg' 5 [<E]- 



(45) 
(46) 
(47) 



Note that by Condition W and Fourier inversion w is Lipschitz with constant 
1/tt and bounded by 1/V. Hence the term (|45| ) can be bounded by E \a^ — 
&o K \ 2+T I '(neh) 2+T . The term @ can be bounded by 

D/"l=.re „-2rei -^ _\ ^- TTl^-K ^-2ki2+t 

^(l ff i ~ °o I > e ) < ~2T7 E l°i - °b I 



Likewise, the term (|47|) can be bounded by 



E 



w 



log((Tl)\ /X-log(ffg) 



/? 



-■»; 



2+r 



■^[ffx^eCi+e 1 -") 1 /" and <t2<£(i+£ 1 -«) 1 /k] 



which is bounded by -P(o"q < 2e) = O(e) since o"q was assumed to have a 

bounded density in a neighbourhood of zero. 

With r = 2q/(l — q) and k = -^— = — jp we have with an application of the 

basic inequality \u K — v K \ < \u — v\ K for u, v > and k G (0, 1] in the second 

equality below and from condition a and its consequence (|33|) in the fourth 

equality 



n 



1 ^CoviW^W,) 



2 h 2 



i¥=3 

1 



-0 



1 1 



-K 2k 1 2+T 



Tj 1 I — K ^-^K> I ^~T ' I — I 



\ 2/(2+r) 



O 



1 1 



r2 |«(2+.)^ 2/(2+ ^ 



n/i 2 A \h 2+T e 2+T J 



1 



-O 



nh 2 A 

1 /AV(2 
-O 



(Efr-ggl)^) /(2+T) 

/i 2 e 2 + 



nh 2 A \ h 2 e 2 
1 /A^)/ 2 



+ e 



2/(2+r) 



O 



+ e 



1-9 



nh 2 A \ h 2 e 2 
which gives the second order bound (|20|) . 



□ 
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